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Abstract 



Secular trends of the incidence of NCDs are especially important 
as they indicate changes of the risk profile of a population. The article 
describes a method for detecting secular trends in the incidence from a 
series of prevalence data - without requiring costly follow-up studies or 
running a register. After describing the theory, the method is applied 
to the incidence of diabetes in Denmark. 
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1 Introduction 

Setting up health information systems to monitor the evolving burden of non- 
communicable diseases (NCDs) and their risk factors, is one of the claims of 
the Moscow Declaration, which was approved by the First Global Ministerial 
Conference on Healthy Lifestyles and NCD control [I]. In that respect, secu- 
lar trends of the incidence of NCDs are especially important as they indicate 
changes of the risk profile of the population under consideration. Common 
ways to detect secular trends in the incidence are either performing a series 
of follow-up studies or running a register. Both approaches may be very 
costly and lead to a variety of practical problems. In contrast, a series of 
prevalence studies sometimes is much easier to accomplish. Based on illness- 
death model (IDM), this article describes a method for detecting trends in 
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the incidence of NCDs without a series of follow-up studies and without a 
register. 

The next section introduces the IDM and derives the theoretical background 
for the method. In the third section, the theory is applied to data from the 
National Diabetes Register in Denmark. The register observed an increasing 
diabetes incidence in 1995-2004. We show that the trend is detectable using 
the IDM and a series of prevalence data. The last section contains a summary. 

2 Illness-Death Model 

In modelling chronic (i.e., irreversible) diseases, often the three-state model 
(compartment model) in Figure [TJ is used. The numbers of persons in the 
states Normal and Disease are denoted by S and C. The transition intensities 
(synonymously: rates) between the states are: the incidence rate i and the 
mortality rates mo and mi of the healthy or the diseased, respectively. These 
rates generally depend on the calendar time t, the age a and in the case of 
the mortality mi also on the duration of the disease d. 




Figure 1: Chronic disease model with three states and the corresponding 
transition rates. People in the state Normal are healthy with respect to the 
disease under consideration. At onset of the disease, they change to state 
Disease. 
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Although the inclusion of the disease duration d is also possible, hereinafter it 
is assumed that mi does not depend on d. This article analytically describes 
the relationship between the prevalence of a chronic disease, the incidence 
and mortality rates. This problem has existed at least since 1934 [5], but 
so far has only been solved in special cases. The article [3j presents a brief 
review and further references. 

As in pp, we look for the numbers S(t, a) and C(t, a) of healthy and diseased 
persons in terms of differential equations, which can be derived from the 
disease model in Figure [TJ For the healthy persons we get the following 
initial value problem: 

(1) (d t + d a )S = - [m + i] S 

S(t — a, 0) = So(t — a). 

Here So(t — a) = S(t — a, 0) is the number of (healthy) newborns^ at calendar 
time t — a. The notation d x denotes the partial derivative with respect to 
x, x G {t, a}. The solution S(t,a) is 

(2) S(t, a) = S (t — a) exp I — / m (t — a + r, r) + i(t — a + r, r) dr 



which may be checked easily. 

The number C of diseased persons are described similarly: 

(3) (d t + d a )C = -mi C + iS 

C(t,0) = 0. 

The solution is 
(4) 



C(t,a) 



J i(t—6,a—6)S(t—S,a—6)expf—J m\{t — 5 + r, a — S + r)dr\ dd. 



Equation (jlj) allows the following interpretation: Starting from (t, a) at 5 
time units before, i.e., at (t — 8, a — 8), exactly i(t — 8, a — 8) S(t — 8, a — 8) 
persons newly enter state Disease. Until (t, a) the proportion 

exp I — / mi (t — 8 + r, a — 8 + r)dr 



1 This paper only considers diseases acquired after birth. 
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of those has survived. Integration over all possible < 5 < a yields the 
number of diseased persons at time (t,a). After applying the quotient rule 
to the age-specific prevalence 

C(t,a) 
P(t ' fl)= S(t,a) + C(t,a) 

and using ((Tj) and (j3J) it follows 

(5) (d t + d a )p= (1 -p) (i-p (mi -mo)) 

p(*,0) = 0. 

Remark 1. For t, a > wrat/i p(t, a) 7^ 1 it holds 

(6) a) = — + ^tt-^ ^ + p(t, a) (mi(t, a) - m (t, a)) . 

1 -p(t,a) 

Furthermore, the solution of © can be calculated directly via ([2]) and (j4j): 
fi(t — 6,a- 5) exp ( - f ^(t - 5 + r, a - 5 + t)(1t) dS 

V / 



(7) p{t,a) 



1 + j i(t - 5, a - 5) exp ( - J ^(t - 5 + r, a - 5 + r)dr^) d<5 
V / 



with \1/ := mi — m — i. 
This follows from 

r 

C(t,a) = / i(t — S, a — 8) S (t — a) 
Jo 

f<5 pa— 8 



— I mi(t — 5 + t, a — 5 + r)dr — / (m + i)(t — a + r, r)dr 
z(i — 5, a — 5) So(t — a) 
■ exp ^— J mi(t — 8 + t, a — S + r)dr — ^ (m + i) (t — a + r, r)dr 
+ / (m + i)(t — a + T,r)dT \ d8 

Ja-8 J 

S (t — a) exp ^— y (m + i)(t — a + r, r)dr^ 



\ 



The first part of the last expression equals S(t, a) and Equation (j7j) follows. 



The usefulness of equation (jTJ) is obvious: Given the incidence i(t, a) and 
mortalities m (t, a), mi(t, a), the prevalence p(t, a) can be calculated for all 
t,a > 0. 

So we can state 

Remark 2. The prevalence p(t, a) is independent from the number So of 
newborns. 

Remark 3. For t,a > it holds: < p(t, a) < 1. 
Remark 4. If for some a x ) £/ie integral 

7 M \ 

T(ti, ai) := / i(ti — 5, a\ — 5) exp I — / ^/(tx — 5 + t, a\ — 5 + r)dr d<5 
o V o / 

is lower than for (t 2 ,a 2 ) : T(ti,ai) < T(t 2 ,ct2); ^en it holds piti.ai) < 
p{^2i a 2)- This follows from observing that x i— > x /i+x, x > is strictly in- 
creasing. 



At the end of the section we introduce the relative mortality R(t,a). For 
(t, a) > with m (t, a) > 0, define 

R(t,a)- mi{t > a) 



mo(t, a) 

Now we have all the definitions and results for the next section. 



3 Diabetes in Denmark 

In the article [2] the age-specific prevalence of diabetes for men (and women) 
in Denmark in the period 1995-2007 is presented in great detail. The results 
are based on a complete survey of the Danish population (n > 5 million). 
Classifying a person as diabetic is done by combining different health regis- 
ters, which yields a sensitivity of more than 85% [21 p. 2188]. In this paper 
we confine ourselves to the male population in Denmark. The age-specific 
incidence rate i for 2004 is given for all age groups, but for the other years in 
the period 1995-2007 just relatively to 2004, averaged across all age groups; 
likewise, with the mortality m$ of the non-diabetic population. Mortality m\ 
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significantly depends on the disease duration [21 Fig. 4]. To the apply the 
model of the previous section model, the duration dependence has to be sup- 
pressed. This is done by an initialization step: The relative mortality R*(a) 
is calculated such that the observed incidence and the associated increase in 
prevalence from 1995 to 1996 are in agreement. Therefor, the Equation ([5j) 
is solved for mi. Then, R* is calculated by R* = m i/m . For the period 1996 
- 2004, this relative mortality is kept fixed and Equation (jSJ) is solved for % as 
in Remark [1] with m\ = R* m$. By doing so, the relative mortality R* for the 
period 1996-2004 is assumed to be independent from calendar time. Thus, 
we have a two-step approach: 

1. Initialization: Calculate R* by fitting the observed incidence rate in 
1995 and the increase in age-specific prevalence from 1995 to 1996. 

2. Application: Derivation of the incidence rates in 1996-2004 via Remark 
[T] mit m,i(t,a) = R*(a) mo (t,a). 

Remark 5. After initialization, we just use the mortality mo of the non- 
diabetic population and the prevalences p(t, a), t = 1996, . . . , 2004, for deriv- 
ing i(t, a), t = 1996, ...,2004. 

Figure [21 shows the results of applying the model to 2001 (circles). For 
comparison the observed incidence is shown (solid line). Obviously, the data 
are in good agreement. 

Now, the increase in the incidence can be examined. For some of the age 
groups, the incidence rate over calendar time is shown in Figure [3J In ad- 
dition, the regression lines and the corresponding correlation coefficients r 
are given. In all age groups there is a significant upward trend. The higher 
the age, the better the fit of the linear regression model. Table [H shows the 
numerical values for all age groups. 

The annual increase rates in the age groups (second column in Table [TJ are all 
greater than the corresponding value 5.3% reported in (21 p. 2190]. However, 
the reported increase of 5.3% refers to all persons (both sexes, all age groups). 
Hence, a direct comparison with the values of Table Q] is impossible. 

4 Summary 

In this work, a novel method for deriving trends in incidence from a sequence 
of prevalence studies is presented. With a view to the tremendous effort 
required by collecting incidence data, the novel method provides a simpler 
alternative. A typical application is the conversion of a sequence of telephone 
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Incidence of diabetes (Males, 2000/01) 




Age (years) 



Figure 2: Age-spefic incidence rate in 2001: observed (solid line) and recon- 
structed by the model (circles). 

surveys for the collection of age-specific prevalence of a chronic disease into 
a sequence of incidence data. 

As a first application, the method was used with data from the Danish Na- 
tional Diabetes Register. The directly observed secular trend in the incidence 
is visible by the new method as well. 

In the application to the Danish Diabetes Register, the relative mortality R* 
in 1995 has been extrapolated for the period 1996-2004. While this may be 
possible for a period of eight years is a word of caution is in order: 

Remark 6. The calendar time trend in mortality mo of the non-diabetic pop- 
ulation is usually much better known than the trend in mortality ni\ of the 
patients. The reason is that mo is surveyed on a demographic scale, while mi 
is investigated sporadically in epidemiological studies only. In epidemiology, 
one might try to link the time trend in m x to the time trend in m Q . The idea 
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Age group 45 to < 50 



Age group 50 to < 55 




1997 1999 2001 2003 1997 1999 2001 

Year Year 



Age group 55 to < 60 Age group 60 to < 65 




1997 1999 2001 2003 1997 1999 2001 2003 

Year Year 




Figure 3: Incidence rates over calendar time for some age groups. 

might be to measure a relative mortality R at some time t' < t and extrapo- 
late from t' to t and set a) = R(t', a) m (t, a). Indeed, time dependence 
of nil is enforced, but this approach may lead to a possibly unexpected in- 
crease in the prevalence. If the incidence i is independent oft and the family 
of functions t \-t m 0ta (t) := m (t,a) is decreasing for all a > 0, then, by 
Remark^ the function t y p a (t) := p(t,a) is monotonically increasing for 
all a > 0. This means: although the incidence i is remains unchanged by 
the calendar time, by increasing the life expectancy of the healthy ( decreasing 
m o,a), the prevalence increases. Extrapolating the relative mortality R from 
t' to t therefore must be viewed critically. 

Beside the presentation of the theoretical background, this work is little more 
than a feasibility study. There are two sources of limitations: 

1. Data: Due to the incomplete detection of diabetes cases and the short- 
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Age- 



Annual 

increase (%) 



Correlation- 
coefficient 



group 



40 - 44 
45 - 49 
50 - 54 
55 - 59 
60 - 64 
65 - 69 
70 - 74 
75 - 79 
80 - 84 



9.2 
9.6 
9.3 
8.8 
8.3 
8.0 
7.8 
7.9 
8.2 



0.76 
0.86 
0.90 
0.93 
0.96 
0.98 
0.99 
0.99 
0.99 



Table 1: Parameters of the secular trends in the incidence rates in the age 
groups. 

ened report of incidence trends (pooled for all persons), a direct com- 
parison between the observed and derived trends in the incidence is 
impossible. 

2. Model: Although it is evident that m x depends on the duration d of 
diabetes, this dependency is neglected. In additon, the relative mor- 
tality R* has been calculated for 1995, but has been extrapolated to 
1996-2004. 

Both inaccuracies interact, which makes a rigorous evaluation difficult. Thus, 
a systematic evaluation of the method based on a comprehensive simulation 
study is necessary. 
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